Performance Analysis of Speaker Identification System Using GMM with VQ
نویسندگان
چکیده
Personal identity identification is an important requirement for controlling access to protected resources. Biometric identification by using certain features of a person is a more secured solution for security identification. Advances in speech processing technology and digital signal processors have made possible the design of high-performance and practical speaker recognition systems. A more flexible speaker identification system is able to operate without explicit user cooperation and independency of the spoken utterance (textindependent mode).This paper proposes a system for text independent speaker identification by extracting MFCC features and implementing optimized GMM speaker modeling. Expectation Maximization algorithm is used to compute the GMM parameters. Performance of the proposed system is evaluated based on its identification accuracy.It is compared with the system using VQ speaker modeling technique. A TIMIT database of 100 speakers is used to study the performance of the proposed system. Key terms: Feature extraction, Speaker modeling, vector quantization, speaker identification, Mel-frequency cepstral coefficients(MFCC), Gaussian mixture model(GMM),Gaussian mixture model-Expectation maximization(GMM-EM)
منابع مشابه
Gmm Based on Local Robust Pca for Speaker Identification
ABSTRACT: To solve the problems of outliers and high dimensionality of training feature vectors in speaker identification, in this paper, we propose an efficient GMM based on local robust PCA with VQ. The proposed method firstly partitions the data space into several disjoint regions by VQ, and then performs robust PCA using the iteratively reweighted covariance matrix in each region. Finally, ...
متن کاملSpeaker Identification Using Gaussian Mixture Models
In this paper, the performance of Perceptual Linear Prediction (PLP) features has been compared with the performance of Linear Prediction Coefficient (LPC) features for speaker identification. Two classification techniques, Gaussian Mixture Models (GMM) and Vector Quantization (VQ) with Dynamic time wrapping (DTW) are used for classification of speakers based on their speech samples into respec...
متن کاملTwo-stage speaker identification system based on VQ and NBDGMM
In this paper, a new speaker identification system is presented. The system can be divided into two subsystems, one close-set speaker identification system and one speaker verification system. The VQ model is used in the close-set speaker identification system and a new method called NBDGMM (Normalization Based on Difference of GMM) is introduced. Experiments have been done to prove that this s...
متن کاملReal-time speaker identification
In speaker identification, most of the computation originates from distance or likelihood computations between the feature vectors of the unknown speaker and the models in the database. The identification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we focus on optimizing vector quantization ...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کامل